Writing a Natural Language Data Base System

نویسندگان

  • David L. Waltz
  • Bradley A. Goodman
چکیده

We present a model for processing English requests for information from a relational data base. The model has as its main steps (a) locating semantie constituents of a request; (b) matching these constituents against larger templates called concept case frames; (c) filling in the concept case frame using information from the user's request, from the dialogue context and from the user's responses to questions posed by the system; and (d) generating a formal data base query using the collected information. Methods are suggested for constructing the components of such a natural language processing system for an arbitrary relational data base. The model has been applied to a large data base of aircraft flight and maintenance data to generate a system called PLANES; examples are drawn from this system. 1. Introduct ion The language processing model for the PLANES system for natural language access to a large data base [9,18-21] has evolved over the past two years to a point where we feel that it is now practical to begin constructing such systems for other data bases. Key ideas and assumptions in the model are described below. The model is designed to handle requests by real, casual users, whose only programming language is English, but who have some knowledge of the material in the data base.t We have assumed that users will ask questions which are often ungrammatical, which include many abbreviations, both standard and non-standard, and which use ellipsis (omission of information to be understood in context) and pronouns extensively. (See Malhotra [13] for ideas about the types of things users are likely to type in.) The model is designed to work with a rela-tional model |3,8], Data is viewed as being divided into relations which correspond to files or sets of files in conventional data base terminology. Each relation contains a collection of tuples which correspond to records; each tuple contains one or more domains or fields. A relation can conveniently be thought of as a table, tin the PLANES systems, we have provided easily accessible HELP files to bring a user without data base knowledge to a point where he can use the rest of the system. with each row being a tuple and each column a domain. There are two important reasons for using the relational approach: (1) The relational approach stresses data independence. This means that the user and front end programs are effectively isolated from …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Writing and Related Problems for EFL Students

ESL students who write in English may present written material in a rhetorical and organizational mode that reflects the pattern which is valued in their native culture and rhetoric. Considering the violation of English code of writing in the writings of Iranian students, we will notice one common characteristic: They are reluctant (or ignorant of) to write a unified paragraph. Their writing co...

متن کامل

The Effect of System-Nested, Genre-Oriented, Structurally-Mediated Model (SGSM) of Writing Instruction, and Swalesian Model (SM) upon Iranian Learners’ Writing Performance: A Comparative Study

The purpose of the present study was first to offer a tentative solution to the problems observed in writing pedagogy in Iran by devising a more comprehensive approach to genre-based writing instruction. In the second phase, a quasi-experimental research design was adopted to determine how effective the model was in writing instruction, compared with the traditional, product-oriented approach, ...

متن کامل

Filtering Software Specifications Written in Natural Language

The speciication phase is one of the most important and least supported part of the software development process. We have conceived SAREL (Assistance System for Writing Software Speciication in Natural Language) as a tool to improve the speciication phase. SAREL is a continuation of a program of research and development called LESD (Linguistic Engineering for Software Design). The purpose of SA...

متن کامل

An Overview of the MIZAR Project

The Mizar project is a long-term eeort aimed at developing software to support a working mathematician in preparing papers. A. Trybulec, the leader of the project, has designed a language for writing formal mathematics. The logical structure of the language is based on a natural deduction system developed by Ja skowski. The texts written in the language are called Mizar articles and are organiz...

متن کامل

The Relationship between First and Second Language Literacy in Writing

This paper explores the ways in which the transfer of assumptions from first language (L1) writing can help the process of writing in second language (L2). In learning second language writing skills, learners have two primary sources from which they construct a second language system: knowledge and skills from first language and input from second language. To investigate the relative impact of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1977